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Abstract 

In the paper we show that the bibliographic data can be transformed into a 
collection of compatible networks. Using network multiplication different in- 
teresting derived networks can be obtained. In defining them an appropriate 
normalization should be considered. The proposed approach can be applied 
also to other collections of compatible networks. We also discuss the ques- 
tion when the multiplication of sparse networks preserves sparseness. The 
proposed approaches are illustrated with analyses of collection of networks 
on the topic "social network" obtained from the Web of Science. 
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1. Introduction 

A collaboration network is usually defined in the following way. The set 
of network's nodes consists of authors. There exists an edge (undirected link) 
between authors u and v iff they produced a joint work (paper, book, report, 
etc.). Its weight w(u,v) is equal to the number of works to which u and v 
both contributed. 

In this case a more basic network is a two-mode network linking the set of 
works with the set of authors. There is an arc (directed link) from the work 



*University of Ljubljana, FMF, Department of Mathematics, Jadranska 19, 1000 Ljubl- 
jana, Slovenia 

**Hruska d.o.o., Kajuhova ulica 90, 1000 Ljubljana, Slovenia 
Email addresses: vladimir. batagelj Of mf .uni-lj .si (Vladimir Batagelj), 
monikaOhruska. si (Monika Cerinsek) 

URL: http://pajek.imfm.si (Vladimir Batagelj) 



Preprint submitted to Scientometrics 



January 22, 2013 



p to the author u iff u is an author of the work p. It is well known that if we 
represent this two-mode network with a matrix WA then we can compute 
the matrix of the corresponding collaboration network as WA T * WA using 
matrix multiplication. 

The problem with matrices of large networks is that they require in their 
standard representation too much computer memory although most of their 
entries are zero. For this reason we introduce a 'parallel' operation of network 
multiplication that deals only with nonzero elements. 

For a given set of works, besides the two-mode network WA on works x 
authors, we can construct other two-mode networks such as WK on works x 
keywords, WC on works x classifications, WJ on works x journals, etc. Since 
these networks have the same first set - the set of works, we can obtain from 
them using multiplication different derived networks. For example WA T * 
WK = AK gives us the two-mode network AK on authors x keywords with 
the weight of the arc (u, k) counting in how many works the author u used 
the keyword k. Additional derived networks can be produced considering 
also the one-mode citation network Ci between works. 

In the paper we first show that we can transform any data table into 
a collection of corresponding two-mode networks. Afterwards we introduce 
the network multiplication and discuss the question when it preserves the 
sparsity of networks. Since the networks from the collection are compatible 
- they share a common set - we can obtain, using multiplication, different 
derived networks. The main part of the paper deals with the problem of 
'normalization' of the weights in the derived networks which is illustrated 
with the case of collaboration networks. The described approach can be 
used also for other derived networks. In the last part of the paper some 
other derived networks for the case of bibliographic networks are presented. 

The introduced concepts are illustrated on the network data set SN5 ob- 
tained in 2008 from the Web of Science for a query "social network*" 
and expanded with existing descriptions of the most frequent references 
and the bibliographies of around 100 social networkers. Using the program 
WoS2Pajek (Batagelj, 2007) the corresponding collection of network data 
was produced: the networks works x authors, works x keywords, cita- 
tion network; partition of works by publication year, and the DC partition 
distinguishing between works with complete description and the cited only 
works. The sizes of the sets are as follows: works \W\ = 193376, works with 
complete description |C| = 7950, authors \A\ = 75930, journals \ J\ = 14651, 
keywords \K\ = 29267. The data set was used for the Viszards session at the 



2 



SunBelt XXVIII, January 22 - 27, 2008, St Pete Beach, Florida. Analyses 
were made in a program Pajek (Pajek wiki (2012)), a tool for analysis and 
visualization of large networks. 

2. Two-mode networks and network multiplication 

2.1. Two-mode networks from data tables 

A data table T is a set of records T = {Tk : k G /C}, where /C is the set 
of keys. A record has the form Tk = (k, qi(k), q2(k), . . . , q r (k)) where qi{k) is 
the value of the property (attribute) qt for the key k. 

Suppose that the property q has the range 2 s . For example, for Wasser- 
man and Faust (1994) : 

Authors(SNA) = { S. Wasserman, K. Faust }, 
PubYear(SNA) = { 1994 }, 

Keywords(SNA) = { network, centrality, matrix, . . . }, ... 

or for Wasserman and Faust (1994); Batagelj and Zaversnik (2011); De Nooy, 
Mrvar and Batagelj (2012); Zaversnik and Batagelj (2004); Kejzar, Korenjak 
Cerne and Batagelj (2010): 



work 


authors 


year 


SNA 


S. Wasserman, K. Faust 


1994 


GenCores 


V. Batagelj, M. Zaversnik 


2011 


Islands 


M. Zaversnik, V. Batagelj 


2004 


ESNA2 


W. de Nooy, A. Mrvar, V. Batagelj 


2012 


IFCS09 


N. Kejzar, S. Korenjak, V. Batagelj 


2010 



Here work is a key, and authors and year are properties. 

If Q is finite we can assign to the property q a two-mode network /C x q = 
(/C, Q,A,w) where (k,v) e A iff v e q(k), and w(k,v) = 1. Note that the set 
Q can always be transformed into a finite set by partitioning it and recoding 
the values. 

Single-valued properties can be represented more compactly by a parti- 
tion. 

For data from the Web of Science (Knowledge) we can obtain the corre- 
sponding networks using the program WoS2Pajek (Batagelj, 2007). Similar 
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programs exist also for other bibliographic data sources /formats: BiBTYrX., 
DBPL, IMDB, Zentralblatt Math, and others. 



2.2. Multiplication of networks 

The product of two compatible networks is essentially the network corre- 
sponding to the product of matrices corresponding to the given networks; or 
in more formal words: 

To a simple (no parallel arcs) two-mode network M = (X, J , A, w); where 
X and J are sets of nodes, A is a set of arcs linking X and J ', and w : A — > R 
is a weight; we can assign a network matrix W = [wij] with elements: 
Wij = w(i,j) for G .4. and w^j = otherwise. 

Given a pair of compatible two-mode networks Ma — (X, /C, .4/1, w^) and 
Ab = (K<, J ,Ab,w b ) with corresponding matrices A Xx k and B^ x ,7 we call 
a product of networks Ma and A/"b a network Ac = (X, J7", «4cs where 
-4c = {(«, j) : i G X,j G J,Cij ^ 0} and w c (i,j) = c itj for (i,j) G A C - The 
product matrix C = [cij]x x j = A * B is defined in the standard way 



In some applications we have to consider the product on other semirings than 
the standard (R,+, -,0, 1) (Batagelj, 1994). 

In the case when X = K = J we are dealing with ordinary one-mode 
networks with square matrices. 

Note that in the expression (1) to the value contribute only the terms 
a>i,k'bk,j m which both factors and bkj are nonzero. For A^(i)UiV^(j) ^ 
we have 



where NA{i) are the successors of node i in network Ma and N^(j) are the 
predecessors of node j in network Mb- 

Therefore, if all weights in networks Ma and Mb are equal to 1 then the 
product a i; k ■ bkj G {0, 1} and the value of counts the number of ways we 
can go from % G X to j G J passing through /C. 

The standard matrix multiplication has the complexity 0(|X| • \K\ ■ \J\) - 
it is too slow to be used for large networks. Most of large networks are sparse 
- their matrices contain much more zero elements than nonzero elements. For 
sparse large networks we can multiply much faster considering only nonzero 
elements. 




(1) 





k£N A (i)UN-(j) 
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Figure 1: Network multiplication 

for k in K. do 

for in N A (k) x N B (k) do 

if 3d :j then cy := Cjj + • 6 fc j 
else new cy := ■ b^j 

In general the multiplication of large sparse networks is a 'dangerous' 
operation since the result can 'explode' - it is not sparse. 

From the network multiplication algorithm we see that each intermedi- 
ate node k e K adds to a product network a complete two-mode subgraph 
^■N-(k),N B (k) ( or > m the case X = J~, a complete subgraph Kn^). If both 
degrees deg^(A;) = |iVj(A;)| and deg B (A;) = |JVb(A;)| are large then already 
the computation of this complete subgraph has a quadratic (time and space) 
complexity - the result 'explodes'. 

It is easy to see that if at least one of the sparse networks Ma an d Mb 
has small maximal degree on /C then also the resulting product network Mo 
is sparse. 

We shall prove a stronger result that if for the sparse networks Ma and Mb 
there are in K, only some vertices with large degree and no one among them 
with large degree in both networks then also the resulting product network 
Mc is sparse. 



5 



Let 

dmin(k) = min(deg A (£;),deg B (£;)) and d max (k) = m&x(deg A (k),deg B (k)). 
Then 

deg A (k) ■ deg B (k) = d min (k) ■ d max {k) 
Define also A min = max feej »c d min (k) and 

K{d) = {ke)C: d max {k) > d} 

Let us denote d* = argmin d (|/C(rf)| < d) and K* = IC(d*). Then |/C*| < d* 
and the number of nonzero elements in the product 

C deg A ( k ) ■ deg B (/c) = d min (k) ■ d max (k) 

^ ^ dmini}^) ' d max {k^j + ^ ^ d m i n (Jz) ■ d max (Jz) 

k<=K* k&K\K* 

— A-min ' ^ ^ d max (k} + (i ■ ^ ^ d m i n (Ji) 

ketc* keic\K* 
< d* ■ (A mm • max(|X|, \J\) + min(|^|, |.A B |)) 

Therefore: 

If for the sparse networks Ma and Mb the quantities A min and d* 
are small then also the resulting product network Mc is sparse. 

That is equivalent to the claimed result. 



3. Collaboration 

3.1. Co-authorship networks 

Let WA be the works x authors two-mode co-authorship network; wa pi G 
{0, 1} is describing the authorship of author i of work p. Then for each work 
p E W : 

wa P i = outdegfjo) 

ieA 

The outdeg(jo) is equal to the number of authors of work p. 
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Let N be its normalized version with n pi describing the share of contri- 
bution of author i to work p such that for each work p G W : 

J2 n P* e {0,1} 

The sum has value for works without authors. 

The contributions be determined by some rules or, assuming that 

each author contributed equally to the work, it can be computed from WA 

as 

wa pi 

pJ max ( 1 , outdeg (p) ) 

A similar normalization of collaboration links, but with outdegfjo) — 1 instead 
of outdeg(jo), was proposed already by Newman (2001). He is interpreting 
the weight as a proportion of time spent for the collaboration with each 
co-author. 

Row-normalization n(Af) is a network obtained from Af in which the 
weight of each arc a is divided by the sum of weights of all arcs having the 
same initial node as the arc a. For binary network A on X x J 

n(A) = diag ( — - — - — -—) *A 

Vmax(l, outdeg(2)) / ieX 

Therefore we can obtain the normalized co-authorship network as 

N = n(WA) 

In some sense reverse transformation is the binarization b(N) of the M: 
it is the original network in which all weights are set to 1. It holds 

WA = 6(N) 

and if N was obtained from WA also WA = b(n(WA)). 

Another useful transformation is the transposition. Transposition Af T or 
t(Af) is a network obtained from M in which to all arcs their direction is 
reversed. For bibliographic networks we introduce the abbreviations AW = 
WA T , KW = WK ' etc. 
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3.2. The first collaboration network 

A standard way to obtain the collaboration network Co from the co- 
authorship network using network multiplication is 

Co = AW * WA 

From 

COij — WdpiWCLpj = 1 

p&w peN(i)nN(j) 

we see that co^- is equal to the number of works that authors i and j wrote 
together. 

The weights in the first collaboration network are symmetric 

COij = wa P iWa P j = wa P jWa P i = coji 

p&W p&W 

One can search for authors with most collaborators. Such authors in 
the set SN5 are listed in Table 1. On the other hand Table 2 shows the 
distribution of output degree of authors in the set SN5. Output degree of 
each author is equal to the number of works he/she co-authored. 



Tabic 1: List of the authors with the largest number of different collaborators in SN5 



i 


author 


collaborators 


i 


author 


collaborators 


1 


Snijders,T 


77 


11 


Rothcnbcrg,R 


58 


2 


Krackhardt,D 


71 


12 


Dorcian,P 


56 


3 


Wasscrman,S 


65 


13 


Breiger,R 


56 


4 


Fcrligoj,A 


63 


14 


Valente,T 


52 


5 


Bcrkman,L 


63 


15 


Butts, C 


52 


6 


van Duijn,M 


63 


16 


Goodreau,S 


52 


7 


Donovan, D 


62 


17 


Draper, D 


51 


8 


Friedman, S 


60 


18 


Batagelj , V 


51 


9 


Latkin,C 


59 


19 


Barabasi,A 


51 


10 


Faust ,K 


59 


20 


Kclly,J 


50 



The obvious question is: who are the most collaborative authors? The 
standard answer is provided by /c-cores, Batagelj and Zaversnik (2011). 

A subset WCVof nodes determines a k-core C = (U, C\U) in the network 
Af = (V, C) iff for each node u e U it holds deg c (w) > k and the set U is the 
maximal such set. The subset of links C\U consists of links from C that have 
both end- nodes in U. 
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Table 2: Outdegree distribution in WA(SN5) 



outdcg 


frequency 


outdcg 


frequency 


paper 


1 


2637 


12 


8 




2 


2143 


13 


4 




3 


1333 


14 


3 




4 


713 


15 


2 




5 


396 


21 


1 


Pierce et al. (2007) 


6 


206 


22 


1 


Allen ct al. (1998) 


7 


114 


23 


1 


Kelly ct al. (1997) 


8 


65 


26 


1 


Semple et al. (1993) 


9 


43 


41 


1 


Magliano ct al. (2006) 


10 


24 


42 


1 


Doll ct al. (1992) 


11 


10 


48 


1 


Snijders et al. (2007) 



In a collaboration network a /c-core is the largest subnetwork with the 
property that each its author wrote a joint work with at least k other authors 
from the core. 

In Figure 2 the cores of orders 20-47 are presented. From this figure we can 
see a serious drawback of directly applying cores for analysis of collaboration 
networks. A work with k authors contributes a complete subgraph on k 
vertices to a collaboration network. For the bibliographies with works with 
large number of authors the cores procedure identifies as the highest level 
cores the complete subgraphs corresponding to these works, and not the 
groups of really the most collaborative authors, as one would expect. 

For the SN5 bibliography the components of the cores of orders 20-47 
are induced by the papers Snijders et al. (2007); Doll et al. (1992); Magliano 
et al. (2006); Semple et al. (1993); Kelly et al. (1997); Allen et al. (1998); 
Pierce et al. (2007) that correspond to the works with the largest number 
of authors (21-48), see Table 2 and Appendix A. In the picture only the 
names of authors that are the end-nodes of links with weight larger than 1 
are displayed. 

An approach to deal with this problem would be to remove all links with 
weight 1 (or up to some other small threshold) and apply cores on the so 
reduced network. 

A better solution is to identify the works with (too) many authors - very 
high outdegree in the network WA - and, for this analysis, remove them from 
the network WA. We can review the removed works separately. 

Yet another approach is to apply on the collaboration network Co the 
Ps-cores (Batagelj and Zaversnik, 2011) - a generalization of the ordinary 
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Figure 2: Cores of orders 20-47 in Co(SN5) 



cores in which the degree deg c (w) is replaced by the sum of weights of links 
from u to other nodes in U 



A subset U C V of nodes determines a ps-core at level t C = (U, C-\U) in 
the network Af = (V, C) iff for each node u G IA it holds ps(u,U) > t and 
the set U is the maximal such set. 

In Figure 3 the ps-core at level 20 is presented. Each author belonging 
to it has at least 20 collaborations with other authors inside the core. 

Again in the network SN5 the cliques corresponding to papers with the 
largest number of authors appear in the ps-core. Besides them we get also 
some strongly collaborating groups such as: { S. Borgatti, M. Everett }, { H. 
Bernard, P. Killworth, C. McCarty, E. Johnsen, G. Shelley }, { R. Rotenberg, 
S. Muth, J. Potterat, D. Woodhouse }, { L. Magliano, M. Maj, C. Malangon, 
A. Fiorillo }, and others. 
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Figure 3: p s -core at level 20 of Co(SN5) 
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To neutralize the overrating of the contribution of works with many au- 
thors we can try with alternative definitions of collaboration networks using 
the normalized co-authorship network. The structure (graph) of the collab- 
oration network remains the same, but the weights change. 

3.3. The second collaboration network 
is defined as 

Cn = M/V * N 

The value of the weight cn^ 



CTlij ^ ^ WJQjpiTLpj ^ ^ ^pj 

pew P eN(i)nN(j) 



is equal to the contribution of author j to works, that he/she wrote together 
with the author i. 

In general the entries of matrix Cn need not to be symmetric (cn^ = 
cnji). In the case when n P i = outd cg^( p ) they are 

™>ij= E n Pi= E outdeg WA (p) = ^ n v* = ™ Tl 
The total contribution of terms wa P in P j for a work p from the definition 

of CUij 

jeA jeA ieN( P ) jeA ieN( P ) 

is equal to the number of authors of the work p. 

Similary, for an author i the total contribution of entries cnij 

^ cnij = E E wa P iU pj = E wapi E Up i = E wa P i = inde SwA W 
jeA jeA pew pew jeA pew 

is equal to the number of works that the author i co-authored; and the 
(diagonal) entry 

peN(i) 
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Table 3: List of the "best" authors in SN5 





si lit h t~iv 




total 




1 


n> u.1 l , ix 


43.83 


53 


0.173 


2 


liuW IllOjIl . 1V1 


36.77 


60 


0.387 


3 


T)fM"pinn P 


34.44 


47 


0.267 


4 


J_> U 1 1 d 1 U 1 1 . IT 


30.17 


41 


0.264 


K 

o 


iVi3xsdcii,P 


29 42 


^7 
o i 


90^ 


Q 


V V t-lllllclll, J_J 


26.87 


41 


0.345 


7 




24.37 


35 


0.304 


g 


Wlnitp H 


23.50 


33 


0.288 


g 


T7riprlL~in NT 


20.00 


23 


0.130 


10 


DDI gcLt LI, O 


19.20 


41 


0.532 


11 


Everett M 


16.92 


31 


0.454 


12 


Litwin,H 


1 fi nn 


91 


9^£ 


13 


J_ I t_-L-lilclll, J_j 


15.53 


90 


0.223 


14 


RamrmQi A 


14.99 


35 


0.572 


15 


^vTl 1 1 arc ' 1 1 
Ollljt-lt;! O, _L 


14.99 


30 


0.500 


16 


Valpntp T 


14.80 


34 


0.565 


17 


t-\ rpi <T £i V* T? 


14.44 


20 


0.278 


18 




14.43 


27 


0.466 


1 Q 


Kro-ckhctrdt ,D 


lO.UO 


9^ 


zl^zl 


20 




12.93 


28 


0.538 


21 


Ratticrm P 


12.10 


27 


0.552 


22 


V V d&sLl lilelll, O 


11.72 


26 


0.549 


23 


RprVwiEiVI T, 


11.21 


30 


0.626 


24 


iviuuuy) j 


10.83 


15 


0.278 


25 


Srott 1 


10.47 


15 


0.302 


26 


T atkin C 


10.14 


37 


0.726 


27 


lVTnrric, 1VT 


9.98 


20 


0.501 


9£ 


R,othcnbcrg,R, 


Q 89 
y .oz 


9R 


fizLQ 


29 


JTVeMJ. Liblllll, V_/ 


9.75 


11 


0.114 


30 


J. CLLIS L jIV 


9.72 


18 


0.460 


31 


Ratao-pli V 


9.69 


20 


0.516 


32 


A/Twrnr'Vii A/T 

1V_L1ZjI LH_-I11 1 1v± 


9.67 


15 


0.356 


33 


[Anon] 


9.00 


9 


0.000 


34 


Johnson, J 


8.89 


21 


0.577 


35 


Fararo,T 


8.83 


16 


0.448 


36 


Lazcga,E 


8.50 


12 


0.292 


37 


Knokc,D 


8.33 


11 


0.242 


38 


Ferligo j , A 


8.19 


19 


0.569 


39 


Brewer, D 


8.03 


11 


0.270 


40 


Klovdahl,A 


7.96 


17 


0.532 


41 


Hammer, M 


7.92 


10 


0.208 


42 


White,D 


7.83 


15 


0.478 


43 


Holme, P 


7.42 


14 


0.470 


44 


Boyd,J 


7.37 


13 


0.433 


45 


Kilduff,M 


7.25 


16 


0.547 


46 


Small,H 


7.00 


7 


0.000 


47 


Iacobucci,D 


7.00 


12 


0.417 


48 


Pappi,F 


6.83 


10 


0.317 


49 


Chen.C 


6.78 


12 


0.435 


50 


Scidman,S 


6.75 


9 


0.250 
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is equal to the total contribution of author i to his/her works. 

Therefore we can base on the entries of matrix Cn the self-sufficiency 
index 

CTla 



indeg WA (i) 

as the proportion of author's contribution to his/her works and the total 
number of works he/she co- authored. 

The collaborativness index is complementary to it 



Ki = l- Si 



All criij values 



Yl Yl CUi i = inde SwA W = m WA 

ieA jeA ieA 

sum up to the number of all links in the network WA. 

In Table 3 the 50 authors with the largest self-contribution cria to the 
topic of 'social network analysis' are presented together with the total number 
of works on the topic that an author co-authored, and his/her collaborativ- 
ness index. 

3.4- The third collaboration network 
is defined as 

Ct = N T * N 

The weight ctij is equal to the total contribution of collaboration of au- 
thors % and j to works. 

The total contribution of a complete subgraph corresponding to the au- 
thors of a work is 1: 

J2J2 n l n Pi = ^2 n pi^2 r hj = J2 n p* ■ 1 = 1 

ieA jeA ieA jeA ieA 

The weights ct^ are symmetric 

C ^ij = ^ ] n ip n pj = ^ j n j p n pi = dji 

pew pew 



and the sum 



c tij ~ n P in Pi ~ n ' pi n ' p 'j ~ 

jeA jeA pew pew jeA -pew 
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is equal to the total contribution of author i to works from W. 
The sum of all weights cUj 

i&A jeA ieA peW pGW ieA p£W 

is equal to the number of all works. 

We can also introduce the author's contribution to the field as 



1^4 

aCl = W ' " 



"/<< 

pdW 



with the property 

= \A\ 

ieA 

Therefore the average ac is 1. 
Note also that 

6(Co) = 6(Cn) = 6(Ct) 



In Figure 4 the p^-core of order 0.75 in the third collaboration network 
Ct(SN5) is presented. In this core the large cliques disappear. The largest 
core's component consists of the main stream social networks researchers 
with the most intensive collaboration pairs: Borgatti and Everett, Killworth 
and Bernard, Bonacich and Bienestock, Ferligoj and Batagelj, Pattison and 
Robins, etc. The second largest component consists of physicists with more 
intensive collaboration pairs: Newman and Park, Barabasi and Albert, and 
Masuda and Konno. In the smaller components we find additional three 
pairs: Lienhard and Holland (with Lienhard represented with two nodes), 
Metzke and Steinhaus, and Chou and Chi. 



4. Derived networks 

4-1. Bibliographic Coupling and Co-Citation 

In WoS2Pajek the citation relation means pCiq = p cites q. Therefore 
the bibliographic coupling network biCo can be determined as (Kessler, 1963) 

biCo = Ci * Ci T 
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Figure 4: ps-core of order 0.75 in the third collaboration network on Ct(SN5) 



16 



The corresponding weight 



blCOpq ^ ^ Clpsdqs ^ ^ 1 

seW seN(p)r\N(q) 

is equal to the number of works cited by both works p and q. It is symmetric 
bicOp q = bico qp . 

Again we have problems with works with many citations, especially with 
review papers. To neutralize their impact we can introduce a normalized 
measure such as 

biCon = hn(Ci) * Ci T + Ci * n(Ci) T ) 

It is easy to verify that bicon pq G [0, 1] and bicon pq = bicon qp (symmetry). It 
also holds: bicon pq = 1 iff the works p and q are referencing the same works. 
Note that 

6(n(Ci) * Ci T ) = 6(Ci * n(Ci) T ). 

The cC pq element of the first term represents the 'importance' of com- 
mon (p, g)-citations for the work p; and the Cc pq element of the second term 
represents the 'importance' of common (p, g)-citations for the work q. 

bicoripg = -(cC pq + Cc pq ) 

Note that the first term in the definition of biCon is equal to the transpose 
of the second term 

(Ci*n(Ci) T )f = n(Ci)*Ci T 

and therefore Cc pq = cC qp . This can be used for more efficient computation 
of biCon. We only need to compute the first term cC. Then 

bicon pq = ^(cC pq + cC qp ) 

In the network biCon(SN5) the larger components with edges with bicon = 
1 correspond to papers with a single reference to a book ( Wasserman, S., 
Faust, K.: Social network analysis. Cambridge UP, 1994; Taylor, Howard F.: 
Balance in small groups. Van Nostrand Reinhold, 1970; Belle, D.: Childrens 
social networks and social supports. Wiley, 1989; Gottlieb, B. H.: Social 
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networks and social support. Sage, 1981; Yan, Yunxiang: The flow of gifts. 
Stanford UP, 1996; Zhang, L.: Strangers in the City. Stanford UP, 2001). 
There are also several pairs of papers with bicon = 1, mostly written by the 
same author. More interesting groups we can obtain as larger islands with 
values below 1. 

Similary the document co-citation network coCi can be determined as 
(Rosengren, 1968; Small, 1973) 

coCi = Ci T * Ci 



The corresponding weight 



^ ^ dspCigq ^ ^ 1 

seW s€N-(p)nN-(q) 



is equal to the number of works citing both works p and q. N~ (p) denotes 
the set of neighbors from which the node p can be entered. 

It holds coCi(AT) = biCo(A/" T ) and also for corresponding normalized 
networks coCin(A0 = biCon(A/" T ). 

4-2. Other derived networks 

The weight aci ip in the author citation network 



ACi = AW * Ci 



counts the number of times author i cited work p. 
The author co-citation network can be obtained as 

ACo = 6(ACi) * 6(ACi) T 

The weight acoij counts the number of works cited by both authors % and j. 
The weight akik in the authors using keywords network 

AK = AW * WK 



counts the number of works in which the author i used a keyword k. 
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4-3. The cited co-authorship network 

Quattrociocchi et al. (2011) proposed the cited co-authorship network: 



AW * diag(indeg Ci (j»)) * WA 

the weight of two collaborating authors is equal to the sum of numbers of 
citations to co-authored works where indeg Ci (p) is number of citations to 
work p. 

Its normalized version would be: 

Cc = AW*diagf ^tl VwA 
Voutdeg Ci (p)V 



with the properties 

, 

* p outdeg Ci (p) 2 



v^v^ indeg Ci (p) 
l^l^wa ip — — j-r- 2 aw pj = mdeg Ci (p) 



ieA jeA 
and 

Y Y CCi i = Y inde gci(p) = \Aa\ 

i€A jeA p&W 

where |^4ci| is the number of arcs in the citation network Ci. 

4-4- Authors' citations network 

The network of citations between authors can be obtained as 

Ca = AW * Ci * WA 

The weight ca^ counts the number of times a work co-authored by % is citing 
a work co- authored by j, see Figure 5. 

In Figure 6 some link islands from Ca(SN5) are presented. The largest 
island consists of the main stream social networks researchers with some 
subgroups: the star around R. Burt in the top left part; the S. Borgatti and 
M. Everett tandem in the bottom left part; the probabilistic group in the 
top right part with G. Robins, P. Pattison, T. Snijders, S. Wasserman, and 
P. Holland as the most prominent; and others: J. Skvoretz, D. Krackhardt, 
P. Doreian, R. Breiger, H. White, L. Freeman, and P. Marsden. 

The "scale-free" island consists mainly of physicists M. Newman, A. 
Barabasi, D. Watts, R. Albert, P. Holme and others. In the "medical" island 
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Figure 5: Authors' citations network 

the central authors are J. Potterat, R. Rothenberg, D. Woodhouse, S. Muth, 
A. Klovdahl, and S. Friedman. There is also an island on "education and 
psychology" with T. Farmer, R. Cairns, B. Cairns, H. Xie, and P. Rodkin. 

Most of the other islands are star-like, usually a professor with his Phd 
students. 

5. Conclusions 

In the paper we showed that the bibliographic data can be transformed 
into a collection of compatible networks. Using network multiplication dif- 
ferent interesting derived networks can be obtained. In defining them an 
appropriate normalization should be considered. The proposed approach 
can be applied also to other collections of compatible networks (see Batagelj, 
2009, pg. 8260-8262). 

Note that most of the obtained derived networks are one-mode networks. 
To analyse them standard SNA methods can be used. For analysis of two- 
mode networks we can use direct methods such as (generalized) two-mode 
cores, two-mode hubs and authorities and 4-rings islands (Ahmed et al., 
2007). 

We can also transform the citation network (and other WoS networks) 
into temporal network using the partition of works by publication year. Using 
the time slices also the temporal sequences of corresponding derived networks 
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Figure 6: Some link islands in the network Ca(SN5) 
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can be obtained. 

Pajek allows analyses on different levels specified by a partition of the 
corresponding set of units and obtained using the shrinking of classes. For 
example: partition of authors by institutions, or partition of institutions by 
countries, partitions of authors by discipline/ field/ subfield, etc. Using the 
extraction of selected classes we can reduce the network to the area of our 
interest. 

The HOW TO in Paj ek for the described approach is available at 
http : //pajek. imfm. si/doku.php?id=how_to :biblio 
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